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Observing H -^ W^*^W^*^ -^ e^jj^^^r 

in weak boson fusion with dual forward jet tagging 

at the CERN LHC 

D. Rainwater and D. Zcppcnfcld 
Department of Physics, University of Wisconsin, Madison, WI 53706 

Weak boson fusion promises to be a copious source of intermediate mass Stan- 
dard Model Higgs bosons at the LHC. The additional very energetic forward jets 
in these events provide for powerful background suppression tools. We analyze the 
H —^ M^(*)l/l^(*) -^ e^jX^'pT decay mode for a Higgs boson mass in the 130-200 GeV 
range. A parton level analysis of the dominant backgrounds (production of W pairs, tt 
^\ • and Z ^ rr in association with jets) demonstrates that this channel allows the observa- 

0^ i tion of _ff — > T/1/(*)T/1/(*' in a virtually background-free environment, yielding a significant 

On ■ Higgs boson signal with an integrated luminosity of 5 fb^^ or less. Weak boson fusion 

" ' achieves a much better signal to background ratio than inclusive H -^ e h^-^t and is 

O i therefore the most promising search channel in the 130-200 GeV mass range. 

(D ■ 

Q: 

^ ■ I. INTRODUCTION 

The search for the Higgs boson and, hence, for the origin of electroweak symmetry breaking and fermion mass 
vj i generation, remains one of the premier tasks of present and future high energy physics experiments. Fits to 
precision electroweak (EW) data have for some time suggested a relatively small Higgs boson mass, of order 
100 GeV M. This is one of the reasons why the search for an intermediate mass Higgs boson is particularly 
p^ ■ important^. 

^sO For the intermediate mass range, most of the literature has focussed on Higgs boson production via gluon 

^D fusion M and ttH or WH{ZH) H] associated production. Cross sections for Standard Model (SM) Higgs 

ON boson production at the LHC are well-known |3], and while production via gluon fusion has the largest cross 

i^ , section by almost one order of magnitude, there are substantial QCD backgrounds. A search for the very clean 

f~i ' four-lepton signature from H -^ ZZ decay can find a Higgs boson in the mass region mn ^130 GeV, but due to 

Oh!^ the small branching fraction of this mode very large integrated luminosities, up to 100 fb ^^ or more, are required. 

f-^ • One can search for gg ^ H via H -^ T4/(*)vi/(*) — + e^ii^-^r decays with much lower luminosity [g|-|7|j but with 

qj ] lower signal-to-background ratios. 

f^ I The second largest production cross section is predicted for weak-boson fusion (WBF), qq -^ qqVV -^ qqH ■ 
IJ ' These events contain additional information in their observable quark jets. Techniques like forward jet tag- 
• ^H , ging Iq-UQ] can then be exploited to significantly reduce the backgrounds. WBF and gluon fusion nicely comple- 
/\ ' ment each other: together they allow for a measurement of the ttH /WW H coupling ratio. 

Another feature of the WBF signal is the lack of color exchange between the initial-state quarks. Color coherence 
between initial- and final-state gluon bremsstrahlung leads to suppressed hadron production in the central region, 
between the two tagging-jet candidates of the signal [O. This is in contrast to most background processes, which 
typically involve color exchange in the i-channel and thus lead to enhanced hadronic activity between the tagging 
jets. We exploit these features, via a veto of soft jet activity in the central region [R. 

While some attention has been given to intermediate- mass H — > Vt^(*)vt^(*) searches at the LHC in the frame- 
work of gluon fusion g,^ , production via weak boson fusion for the same decay mode has not yet been discussed 
in the literature. Thus, we provide a first analysis of intermediate-mass VV ^ H ^ Vl/(*)Vl^(*) at the LHC (and 
of the main physics and reducible backgrounds) which demonstrates the feasibility of Higgs boson detection in 
this channel, with very low luminosity. H — > T4/(*)T4/(*) event characteristics are analyzed for dual leptonic decays 
to e^ ix^ only, to avoid backgrounds from Z, 7 ^ e^e^ ,11^11^ . 

Our analysis is a parton-level Monte Carlo study, using full tree-level matrix elements for the WBF Higgs 
signal and the various backgrounds. In Section O we describe our calculational tools, the methods employed in 
the simulation of the various processes, and important parameters. Extra minijet activity is simulated by adding 
the emission of one extra parton to the basic signal and background processes. Generically we call the basic signal 
process (with its two forward tagging jets) and the corresponding background calculations "2-jet" processes, and 



refer to the simulations with one extra parton as "3-jet" processes. In Section [II, using the 2-jet programs for 



the backgrounds, we demonstrate forward jet tagging, a b veto and other important cuts which combine to yield 



an « 2/1 to 1/2 signal-to-background (S/B) ratio, depending on the Higgs mass. 



In Section IV we analyze the different minijet patterns in signal and background, using both the truncated 
shower approximation (TSA) ||T^ to regulate the cross sections, and the gluon exponentiation model to estimate 
the minijet multiplicity [ |l3| . By exploiting the two most important characteristics of the extra radiation, its 
angular distribution and its hardness, the QCD backgrounds can be suppressed substantially by a veto on extra 
central jet emission. Within the TSA and exponentiation models, probabilities are estimated for vetoing signal and 
background events, and are combined with the production cross sections of the previous section to predict signal 
and background rates in Table y. These rates demonstrate the feasibility of extracting a very low background 
H -^ H/(*)h/(*) signal at the LHC. 

Our signal selection is not necessarily optimized yet. The variables we identify for cuts are the most distinctive, 
but deserve a multivariate analysis with detector simulation. We do construct an additional variable in Section M 
which is not used for cuts, but rather can be used to extract the Higgs boson mass from the final event sample. 

II. CALCULATIONAL TOOLS 

We simulate pp collisions at the CERN LHC, ^ = 14 TeV. All signal and background cross sections are 
determined in terms of full tree level matrix elements for the contributing subprocesses and are discussed in more 
detail below. 

For all our numerical results we have chosen svc^Oy/ = 0.2315, Mz = 91.19 GeV, and Gp — 1.16639 • 
10~^ GeV"^, which translates into Mw = 79.94 GeV and a{Mz) = 128.74 when using the tree-level rela- 
tions between these input parameters. This value of Mw is somewhat lower than the current world average 
of « 80.35 GeV. However, this difference has negligible effects on all cross sections, e.g. the qq — > qqH signal 
cross section varies by about 0.5% between these two W mass values. The tree level relations between the input 
parameters are kept in order to guarantee electroweak gauge invariance of all amplitudes. For all QCD effects, 
the running of the strong coupling constant is evaluated at one-loop order, with as{Mz) = 0.118. We employ 
CTEQ4L parton distribution functions jlj] throughout. Unless otherwise noted the factorization scale is chosen 
as Hf — min(pT) of the defined jets. 

A. The qq -^ qqH{g) signal process 

The signal can be described, at lowest order, by two single-Feynman-diagram processes, qq -^ qq{WW, ZZ) -^ 
qqH, i.e. WW and ZZ fusion where the weak bosons are emitted from the incoming quarks ||l^. Because of 
the small Higgs boson width in the mass range of interest^ these events can reliably be simulated in the narrow 
width approximation. From previous studies of iJ ^ 77 p6| and H ^ tt [n^ decays in weak boson fusion we 
know several features of the signal, which can be exploited here also: the centrally produced Higgs boson tends 
to yield central decay products (in this case M^+VF"), and the two quarks enter the detector at large rapidity 
compared to the M^'s and with transverse momenta in the 20 to 100 GeV range, thus leading to two observable 
forward tagging jets. 

For the study of a central jet veto, we utilize the results of previous studies where we simulated the emission of 
at least one extra parton |l^,n8]. This was achieved by calculating the cross sections for the process qq -^ qqHg, 
i.e. weak boson fusion with radiation of an additional gluon, and all crossing related processes. 

An important additional tool for distinguishing the H -^ e^fi^-^T signal from various backgrounds is the anti- 
correlation of the W spins, as pointed out in Ref. . This is due to the preservation of angular momentum in 
the decay of the spin-0 Higgs boson. Of course, we can observe only the angular distributions of the charged 
decay leptons, but this is sufficient. The decay rate is proportional to {pg- ■ Pv)(j>i+ ' Pu)- In the rest frame of 
the Higgs boson, in which the e^v or e'^v pairs are emitted back-to-back for W^W~ production at threshold, 
this product is a maximum for the charged leptons being emitted parallel. This characteristic is preserved and 
even enhanced when boosted to the lab frame, as the Higgs boson in weak boson fusion is typically emitted with 
PT « 60 - 120 GeV. 

B. The QCD ti^ jets backgrounds 

Given the H decay signature, the main physics background to our e ji^^t signal arises from ti+jets production, 
due to the large top production cross section at the LHC and because the branching ratio B{t ^ Wb) is essentially 



100%. 

The basic process we consider is pp — > tt, which can be either gg- or gg-initiated, with the former strongly 
dominating at the LHC. QCD corrections to this lead to additional real parton emission, i.e. to ti + j events. 
Relevant subprocesses are 

gq^ttq, gq ^ ttq , qq ^ ttg , gg -^ tig , (1) 

and the subprocesses for tt + jj events can be obtained similarly. For the case of no additional partons, the 6's 
from the decaying top quarks may be identified as the tagging jets. In this case, calculating the cross section 
for ti + j where the 6's are explicitly identified as the tagging jets serves to estimate the effect of additional soft 
parton emission, i.e. minijet activity in the central detector; this is described in detail in Sec. f^/\ . At the same 
time, we can identify a distinctly different, perturbative region of phase space, where the final-state light quark 
or gluon gives rise to one tagging jet, and one of the two decay 6's is identified as the other tagging jet. In this 
case, ti + jj may be used to estimate minijet activity for the hard process pp — > ti + j. Finally, there is a third 
distinct region of phase space, for the perturbative hard process pp —^ ti + jj, where the final state light quarks 
or gluons are the two tagging jets. 

Thus, the "ttj" and "ttjj" calculations serve a dual purpose: to obtain the cross sections for the contribution 
of the perturbative processes where light quark or gluon jets lie in the region of phase space where they are 
experimentally identified as far-forward/backward tagging jets; and to estimate the additional QCD radiation 
patterns for the next- lower-order perturbative ti+jets process. The tt and tij matrix elements were constructed 
using MadGraph |Q , while the tijj matrix elements are from Ref. [gOJ . 

Decays of the top quarks and VF's are included in the matrix elements; however, while the VF's are allowed 
to be off-shell, the top quarks are required to be on-shell. Energy loss from b —> ivX is included to generate 
more accurate ^t distributions. In all cases, the factorization scale is chosen as /ij — uAti{Et) of the massless 
partons/top quarks. The overall strong couphng constant factors are taken as (as)" — n"=i Q^s(^Ti), where the 
product runs over all light quarks, gluons and top quarks; i.e. the transverse momentum of each additional parton 
is taken as the relevant scale for its production, irrespective of the hardness of the underlying scattering event. 
This procedure guarantees that the same a1 factors are used for the hard part of a ti+jets event, independent 
of the number of additional minijets, and at the same time the small scales relevant for soft-parton emission are 
implemented. 

C. The QCD WW + jj background 

The next obvious background arises from real-emission QCD corrections to W~^W~ production. For W^W^jj 
events these background processes include [gl[ 

qg -^ qgW+W- , qq' -^ qq'W+W' , (2) 

which are dominated by i-channel gluon exchange, and all crossing related processes, such as 

qq ^ ggW+W- , gg ~* qqW+W' . (3) 

We call these processes collectively the "QCD WWjj" background. We do not calculate cross sections for the 
corresponding WW + 3-jet processes, but instead follow the results of our analysis of the radiation patterns of 



QCD Z + jets processes, detailed in Sec. ^, and apply those results here to estimate minijet veto probabilities. 

The factorization scale is chosen as for the Higgs boson signal. The strong coupling constant factor is taken as 
(tts)^ = as{pTi)ctsiPT2)j i-^-i the transverse momentum of each additional parton is taken as the relevant scale 
for its production. Variation of the scales by a factor 2 or ^ reveals scale uncertainties of « 35%, however, which 
emphasizes the need for experimental input or NLO calculations. 

The WW background lacks the marked anti-correlation of W spins seen in the signal. As a result the momenta 
of the charged decay leptons will be more widely separated than in H —> VF(*)h^(*) events. 

D. The EW WW + jj background 

These backgrounds arise from W^W~ bremsstrahlung in quark-(anti)quark scattering via i-channel elec- 
troweak boson exchange, with subsequent decay W~^W~ -^ £^£~-^t- 



qq' -^ qq'W+W- (4) 

Naively, this EW background may be thought of as suppressed compared to the analogous QCD process in Eq. (g). 
However, it includes electroweak boson fusion, VV -^ W^W~ via s- or i-channel 7/Z-exchange or via WW 
4-point vertices, which has a momentum and color structure identical to the signal. Thus, it cannot easily be 
suppressed via cuts. 

The matrix elements for these processes were constructed using MadGraph |19(| . We include charged-current 
(CC) and neutral-current (NC) processes, but discard s-channel EW boson and t-channel quark exchange processes 
as their contribution was found to be « 1% only, while adding significantly to the CPU time needed for the 
calculation. In general, for the regions of phase space containing far-forward and -backward tagging jets, s-channel 
processes are severely suppressed. We refer collectively to these processes as the "EW WWjj" background. Both 
Ws are allowed to be off-shell, and all off-resonance graphs are included. In addition, the Higgs boson graphs 
must be included to make the calculation well-behaved at large VF-pair invariant masses. However, these graphs 
include our signal processes and might lead to double counting. Thus, we set rriH to 60 GeV in the EW WWjj 
background to remove their contribution. A clean separation of the Higgs boson signal and the EW WWjj 
background is possible because interference effects between the two are negligible for the Higgs boson mass range 
of interest. 

Again we will need an estimate of additional gluon radiation patterns. This was first done for EW processes in 
Ref. p2|, but for different cuts on the hard process, and again for EW rrjj processes in Rcf. m%. We reanalyze 



the EW TTJj case in Sec. IV and directly apply the resulting minijet emission probabilities here. The EW ttjj 
and EW WWjj backgrounds are quite similar kinematically, which justifies the use of the same veto probabilities 
for central jets. 

E. The QCD and EW t+t- backgrounds 

The leptonic decay of r's provides a source of electrons, muons and neutrinos which can be misidentified as W 
decays. Thus, we need to study real-emission QCD corrections to the Drell-Yan process qq — > {Z,j) — > t^t^. 
For T^T^jj events these background processes include [E3| 

qg -^ qgT^T' , qq' -^ qq'T+T~ , (5) 

which are dominated by t-channel gluon exchange, and all crossing-related processes, such as 

qq -* 99T^T^ , 99 -* qqr+T^ . (6) 

All interference effects between virtual photon and Z-exchange are included. We call these processes collectively 
the "QCD TTJj" background. The cross sections for the corresponding tt + 3-jet processes, which we need for 
our modeling of minijet activity in the QCD ttjj background, have been calculated in Refs. ||24|-[2q|. Similar to 
the treatment of the signal processes, we use a parton-level Monte-Carlo program based on the work of Ref. pSf l 
to model the QCD ttjj and ttjjj backgrounds. 

From our study oi H ^ tt in weak boson fusion ||l^ , we know that the EW (t-channel weak boson exchange) 
cross section will be comparable to the QCD cross section in the phase space region of interest. Thus, we consider 
those processes separately, in a similar manner as for the EW WWjj contribution. We use the results of Ref. p^ 
for modeling the EW ttjj background. 

The dual leptonic decays of the r's are simulated by multiplying the t^t^ jj cross section by a branching 
ratio factor of (0.3518)^/2 and by implementing coUincar tau decays with hclicity correlations included as in our 
previous analysis oi H ^ tt fl^ . 

F. Detector resolution 

The QCD processes discussed above lead to steeply falling jet transverse momentum distributions. As a result, 
finite detector resolution can have a sizable effect on cross sections. These resolution effects are taken into account 
via Gaussian smearing of the energies of jets/6's and charged leptons. We use 



AE _ 5.2 0.4 



.009 , (7) 



for jets (with individual terms added in quadrature), based on ATLAS expectations |g^. For charged leptons we 

= 2% . (8) 
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In addition, finite detector resolution leads to fake niissing-transverse-momentum in events with hard jets. An 
ATLAS analysis [Q showed that these effects are well parameterized by a Gaussian distribution of the components 
of the fake missing transverse momentum vector, ^t, with resolution 



a{i)^,i)y) = 0.46 • ^J2^T.had , (9) 

for each component. In our calculations, these fake missing transverse momentum vectors are added linearly to 
the neutrino momenta. 

III. HIGGS SIGNAL AND REAL W+W- BACKGROUNDS 

The qq -^ qqH, H -^ VF'^*^VF*^*-' -^ e^fM^vP dual leptonic decay signal is characterized by two forward jets 
and the W decay leptons (e,/i). Before discussing background levels and further details like minijet radiation 
patterns, we need to identify the search region for these hard Hjj events. The task is identical to the Higgs 
searches in qq —> qqH, H — > ^^,tt which were considered previously |16|Jl7[] . We can thus adopt the strategy of 
these earlier analyses and start out by discussing a basic level of cuts on the qq -^ qqH, H -^ H^(*)h^(*) signal. 
Throughout this section we assume a Higgs mass of Mh = 160 GeV, but we do not optimize cuts for this mass. 

The minimum acceptance requirements ensure that the two jets and two charged leptons are observed inside 
the detector (within the hadronic and electromagnetic calorimeters, respectively), and are well-separated from 
each other: 

PT, > 20 GeV , \r]j\< 5.0 , ARjj > 0.7 , 

PT, >20GeV, \r]e\<2.5, ARje>0.7. (10) 

A feature of the QCD WWjj background is the generally higher rapidity of the VF's as compared to the Higgs 
signal: weak boson bremsstrahlung occurs at small angles with respect to the parent quarks, producing PF's 
forward of the jets. Thus, we also require both £'s to lie between the jets with a separation in pseudorapidity 
^Vj,i > O-"^: s-nd the jets to occupy opposite hemispheres: 

?7j,mm + 0.7 < 77^1 2 < T]j^„iax ^ 0.7 , TJj^ ■ TJj^ < (11) 

Finally, to reach the starting point for our consideration of the signal and various backgrounds, a wide separation 
in pseudorapidity is required between the two forward tagging jets, 

AT^taffs = hji - %-J > 4.4 , (12) 

leaving a gap of at least 3 units of pseudorapidity in which the ^'s can be observed. This technique to separate 
weak boson scattering from various backgrounds is well-established |7|- [lC| , [lq -[l8[ , in particular for heavy Higgs 
boson searches. Line 1 of Table | shows the effect of these cuts on the signal and backgrounds for a SM Higgs 
boson of mass mn = 160 GeV. Overall, about 28% of all H -^ w''*^W^*^ -^ e^^^vD events generated in weak 
boson fusion are accepted by the cuts of Eqs. ( p^fjl^ ) (for niH ~ 160 GeV). 

Somewhat surprisingly, the EW WWjj background rate reaches 2/3 of the QCD WWjj background rate 
already at this level. This can be explained by the contribution from W, Z, 7 exchange and fusion processes which 
can produce central W pairs and are therefore kinematically similar to the signal. This signal-like component 
remains after the forward jet tagging cuts. 

As is readily seen from the first line of Table ffl the most worrisome background is W pairs from tt -t- jets 
production. Of the 1080 fb at the basic cuts level, 12 fb are from tt, 310 fb are from tij, and the remaining 760 fb 
arise from ttjj production. The additional jets (corresponding to massless partons) are required to be identified 
as far forward tagging jets. The ttjj cross section is largest because the tt pair is not required to have as large 
an invariant mass as in the first two cases, where one or both 6's from the decay of the top quarks are required 
to be the tagging jets. 
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FIG. 1. Normalized invariant mass distribution of the two tagging jets for the signal (red) and various back- 
grounds: tt + jets (blue), QCD WWjj (solid green ), EW WWjj (solid purple), QCD tt + jj (dashed green) 
and EW tt 



- jj (dashed purple). The cuts of Eqs. (10-13|) are imposed 



For the events where one or both of the b's are not identified as the tagging jets, they will most frequently lie 
between the two tagging jets, in the region where we search for the W decay leptons. Vetoing events with these 
additional b jets provides a powerful suppression tool to control the top background. Note that this does not 
require a 6-tag, merely rejection of any events that have an additional jet, which in this case would be from a 
hadronically decaying b. We discard all events where a 6 or 6 jet with pr > 20 GeV is observed in the gap region 
between the tagging jets, 



PT, > 20GeV , 



^j,min ^ Vb ^ ^j,max ■ 



(13) 



This leads to a reduction of ttj events by a factor 7 while ttjj events are suppressed by a factor 100. This results 
in cross sections of 43 and 7.6 fb, respectively, at the level of the forward tagging cuts of Eqs. ( |lO[jl^ ), which are 
now comparable to the other individual backgrounds. This is shown in the second line of Tablell Note that the 
much higher b veto probability for tijj events results in a lower cross section than that for ttj events, an ordering 
which will remain even after final cuts have been imposed (see below). 

QCD processes at hadron colliders typically occur at smaller invariant masses than EW processes, due to the 
dominance of gluons at small Feynman x in the incoming protons. We observe this behavior here, as shown in 
Fig. n^. The three tt + jets backgrounds have been combined for clarity, even though their individual distributions 
are slightly different. We can thus significantly reduce much of the QCD background by imposing a lower bound 
on the invariant mass of the tagging jets: 



> 650 GeV 



(14) 



Another significant difference is the angular distribution of the charged decay leptons, e and /i^, relative to 
each other. In the case of the Higgs signal, the W spins are anti-correlated, so the leptons are preferentially 
emitted in the same direction, close to each other. A significant fraction of the various backgrounds does not have 
anti-correlated W spins. These differences are demonstrated in Fig. El which shows the azimuthal (transverse 
plane) opening angle, polar (lab) opening angle, and separation in the lego plot. We exploit these features by 
establishing the following lepton-pair angular cuts: 



i^efi 



< 105° 



cos ^e^ > 0.2 , 



AR,^ <2.2. 



(15) 

It should be noted that while these cuts appear to be very conservative, for higher Higgs boson masses the (pep, and 
Ai?e/i distribution broadens out to higher values, overlapping the backgrounds more. For ttih ~ 180 — 200 GeV 
these cuts are roughly optimized and further tightening would require greater integrated luminosity for discovery 
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FIG. 2. Normalized angular distributions of the charged leptons: aziniuthal opening angle, lab opening angle, 
and separation in the lego plot. Results are shown for a Higgs boson mass of 160 GeV and 190 GeV (solid and 
dashed red lines) and for the various backgrounds as in Fig. nl Lepton angular separation is clearly smaller for 
the rtiii = 160 GeV scenario. The cuts of Eqs. ([lO| - |l3|) are imposed. 

TABLE I. Signal rates a ■ B(H -^ e^[i^j)T) for Wff = 160 GeV and corresponding background cross sections, 
in -pp collisions at \fs — 14 TeV. Results are given for various levels of cuts and are labeled by equation numbers 
discussed in the text. On lines six the minijet veto is included. Line five gives the survival probabilities for each 
process, with p^"^*" — 20 GeV. The expected tagging jet identification efficiency is shown on the last line. All 
rates are given in fb. 
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at this upper end of the mass range. Because of the excellent signal-to-background ratio achieved below, we prefer 
to work with uniform acceptance cuts, instead of optimizing the cuts for specific Higgs boson mass regions. 

We also examine the distributions for lepton-pair invariant mass, TOg^t, and maximum lepton pT, as shown in 
Fig. H for the case mn = 160, 190 GeV. As is readily seen, the QCD backgrounds and EW WWjj background 
prefer significantly higher values for both observables. Thus, in addition to the angular variables, we find it useful 
to restrict the individual px of the leptons, as well as the invariant mass of the pair: 



TOe 



< 110 GeV , pn^ < 120 GeV . 



(16) 



These are particularly effective against the top backgrounds, where the large top mass allows for very high-pT 
leptons far from the tagging jets, and against the EW WWjj background, where the leptons tend to be well- 



separated in the lego 



Results after cuts (nj 



Again, the cuts are set quite conservatively so as not to bias a lower Higgs boson mass. 



- |16| ) are shown on the third line of Table |, for the case of a 160 GeV Higgs boson. 
At this level of cuts we now observe the combined QCD and EW rrjj backgrounds to remain large, contributing 
almost 40% of the total. We can take advantage of the fact that in these backgrounds, the Z or 7 is emitted with 
quite high pT, on the order of 100 GeV, which contributes to large r boosts and causes the r decay products to 
be nearly coUinear in the lab frame. Within the coUinear approximation, the r momenta can be reconstructed 
knowing the charged lepton momenta and the missing transverse momentum vector |17| , p9[ . Labeling by x^ , Xr^ 
the fraction of t energy each charged lepton takes with it in the t decay, 'pT,XTi>T,y can be used to solve the two 
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FIG. 3. Normalized distributions of the dilepton invariant mass and maximum charged lepton momentum after 
the cuts of Eqs. (p^p^. Results are shown for a Higgs boson mass of 160 GeV and 190 GeV (solid and dashed 
red lines) and for the various backgrounds as in Fig. |l|. The rriH = 160 GeV curve peaks at lower values of We^ 
and Pt«.„„, ■ 



equations (x,y transverse directions) for the two unknowns x^ 2 • For real r decays, the "^t vector must lie between 
the two leptons, and apart from finite detector resolution the reconstruction must yield < x^ 2 < 1- For the 
Hjj signal and other backgrounds, the coUinear approximation is not valid because the W's receive modest boosts 
in the lab only. In this case, the ^t vector will rarely lie between the two leptons, and an attempt to reconstruct 
a r pair will result in x^ < or Xr2 < for 95% of the events, n Additionally, the "r pair" invariant mass that is 
reconstructed does not peak at mz, even when it is positive. We can therefore apply a highly efhcient cut against 
the QCD and EW ttjj backgrounds by vetoing events where an attempt to reconstruct a r pair in the coUinear 
decay approximation results in two "real" r's near the Z pole: 



>0, 



mz - 25 GeV < nirr < mz + 25 GeV . 



(17) 



The results of this final cut are shown in line four of Table |. The t backgrounds are virtually eliminated, while 
the signal and the other backgrounds each lose « 5%. 

IV. RADIATION PATTERNS OF MINUETS 

If we are to veto central b jets to reduce the ti+ jets background to a manageable level, we must take care to 
correctly estimate higher-order additional central partonic emission in the signal and backgrounds. Fortunately, 
due to the absence of color exchange between the two scattering quarks in EW processes, which includes our 
Hjj signal, we expect soft gluon emission mainly in the very forward and very backward directions. However, 
for QGD processes, which are dominated by t-channel color octet exchange, soft gluon radiation occurs mainly in 
the central detector. Thus, when we estimate additional central radiation with px > 20 GeV to match our b veto 
condition, we will reject QGD background events with much higher probability than the EW processes. Our b 
veto is then automatically also a minijet veto, a tool for QCD background suppression which has been previously 
studied in great detail for Hjj production at hadron colliders ||7|, [l7| , p2| . 



^Conversely, requiring x^-^ > 0, Xr2 > largely eliminates WW backgrounds and promises clean isolation of 



H 



=±„=F 



fj-'^h 



Largely following the analysis of Ref. ||T^ for the analogous EW Zjj process which would be used to "calibrate" 
the tool at the LHC, we veto additional central jets in the region 



PtT > PT^..to , (18a) 

tag veto tag Qg^^x 



where pr.veto may be chosen based on the capability of the detector. For the LHC we take this to be 20 GeV. 

For pT,vcto ~ 40 GeV we are already leaving the validity range of fixed-order perturbation theory in QCD 
processes: the cross section with one additional parton starts to exceed the hard tree-level cross section. As a result 
it becomes difficult to provide reliable theoretical estimates of minijet emission rates for the QCD backgrounds. 
However, gluon emission is governed by very different scales in signal as compared to background processes, due 
to their different color structures. Thus, a parton shower approach does not immediately give reliable answers 
unless both color coherence and the choice of scale are implemented correctly, matching the answer given by QCD 
matrix elements for sufficiently hard partons. 

While the necessary information on angular distributions and hardness of additional radiation is available in 
the "3-jet" and tt + jets processes discussed in Section ||, we must either regulate or reinterpret these divergent 
cross sections. We use the truncated shower approximation (TSA) |jl3] for the former, treating the "2-jet" cross 
sections as the inclusive rate. Details of this procedure can be found in Refs. |17| , |l8| , but here we improve upon the 
determination of veto probabilities. In our previous studies, TSA matching was performed without enforcement 
of the forward tagging cuts of Eqs. ( [ll| , p^ , even though tagging jet candidates were chosen for the purpose of 
ide ntify ing the veto candidate; tagging jet candidates were selected as the two most energetic |lq| or two highest- 
Pt Hj defined jets {px > 20 GeV), in opposite detector hemispheres. Without the additional forward tagging 
cuts, in particular without the large rapidity separation of the two tagging jets, this favors QCD background 
events with high pT central quark jets which in turn lead to a harder gluon emission spectrum than is present 
after all final cuts are imposed. A harder gluon spectrum goes hand-in-hand, however, with an increased minijet 
emission probability. A more realistic estimate of the minijet px spectrum is obtained by applying the matching 
condition (or calculating n) only in the phase space region where a comparison of signal and background will take 
place: after all acceptance cuts, determined at the two-jet level, have been imposed. 

Once the full level of cuts for a given search scenario are imposed, one may examine different tagging jet 
selection algorithms to optimize the veto. Ideally, the outgoing quarks would always be selected, so that the 
additional gluon radiation is always the veto candidate. In practice, this is impossible, but for the Higgs signal 
various algorithms can achieve "proper" quark tagging with about 75% efficiency, a high success rate. Briefly, 
these might be the two highest-pr jets, or the two jets closest to the reconstructed Higgs. Most algorithms have 
very little difference from each other in the case of the WBF signature. Thus, we choose an algorithm that allows 
more suppression of the QCD backgrounds. The final algorithm we chose is to select the highest-pT jet as the 
first tagging jet, since it will almost always be part of the hard scattering, and then select the other tagging jet 
such that the event is more likely to pass the forward tagging cuts: look for jets with px > 20 GeV in the opposite 
hemisphere, such that the candidate Higgs decay products are between the tagging jets, satisfying Eq. (|l^). This 
performs somewhat superior to merely choosing the two highest-pr jets. 

Also in contrast to our previous studies ||l^,|l^, the veto candidates are defined jets {pr > 20 GeV) anywhere 
between the tagging jets, i.e. they are searched for in a somewhat larger rapidity region than the W decay 
leptons (see Eq. (Kw, which have to be at least 0.7 units of rapidity away from the tagging jets. The choice of 



Eq. (18b) allows for more suppression of the backgrounds than the more restrictive selection. The resulting veto 
probabilities are summarized in line six of Table |. We emphasize that while these probabilities are estimates 
only, they can be independently determined at the LHC in processes like Zjj and Wjj production [|l8|,^ . 

For the tt + jets backgrounds, it is simpler instead to reinterpret the divergent higher-order cross sections. For 
this we assume that additional soft parton emission, which will be dominated by soft gluons, exponentiates like 
soft photon emission. This approximation has been shown well to describe multijet events at the Tevatron |13]. 
In this model, the probability to observe n soft jets in the veto region is given by a Poisson distribution with 

n = n{pT.vcto) = — / dpT3 -r-^ t (19) 

^2 Jp^.,„,„ dpT3 

where the unregularized three-parton cross section is integrated over the veto region of Eq. dlq) and then nor- 
malized to the 2-jet cross section a2, regarded as inclusive. We call this model the "exponentiation model". A 
rough estimate of the multiple emission probability is thus provided by 

Pe.p(pT,vcto) = 1 - Po = 1 - e-«(''— ) (20) 



TABLE II. Number of expected events for the Hjj signal, for 5 fb~ integrated luminosity and application 
of all efficiency factors and cuts, including a minijet veto, but for a range of Higgs boson masses. The total 
background is 8.1 events. As a measure of the Poisson probability of the background to fluctuate up to the signal 
level, the second line gives crcaMss, the number of Gaussian equivalent standard deviations. 



ran 


115 


120 


130 


140 


150 


160 


170 


180 


190 


200 


no. events 

CGauss 


2.0 
0.5 


3.6 
1.0 


8.8 
2.6 


15.8 
4.4 


24.0 
6.3 


37.5 
9.0 


36.3 

8.8 


29.9 

7.5 


20.8 
5.5 


16.3 

4.5 



which is the probability to veto the event. We find veto survival probabilities of Psurv = 46% for ti events and 
Psurv = 12% for ttj events. Both of these results disagree with our other estimates of Psurv for QCD processes. 
This may be understandable for ti events, as at tree level this component does not contain any t-channel gluon 
exchange processes, which all of the other QCD backgrounds do. We also observe that the additional radiation 
in ti events typically falls outside the central gap. We did not explore this any further as the ti component is 
negligible. That the value of Psurv found for tij events is so much smaller than that for other QCD backgrounds 
may be understood because the large mass of the tops produced requires significant pr, but as this is not yet fully 
explored we prefer to remain conservative and assume the value Psurv = 0.29 for all QCD backgrounds, including 
top quark pair production. For a Higgs boson mass of 160 GeV we are left with a signal cross section of 7.5 fb 
compared to a total background of 1.6 fb. 

So far we have considered a single Higgs boson mass of 160 GeV only. Since we have largely avoided mass- 
specific cuts, we can immediately extend our results to a larger range of mn- The expected number of signal 
events for 115 GeV < niH < 200 GeV and an integrated luminosity of 5 fb"-'^ are shown in Table |l|. For the same 
luminosity, 8.1 background events are expected. In the second row of Tabic O the Poisson probabilities for this 
background to fluctuate up to the signal level are given, in terms of the equivalent Gaussian significances which 
can be expected in the experiment on average. 

V. DISCUSSION 

While we have demonstrated a series of kinematic cuts and a minijet veto, based on the physics of the processes 
involved, that reduce the background well below the level of the signal, we have not performed a detailed detector 
simulation. A full simulation is clearly needed eventually, but we do not expect our results to change dramatically 
since CMS and ATLAS will be highly efficient detectors. For a more realistic estimate of the signal significance we 
do take one major reduction factor into account in the last line of Table |, the reconstruction efficiency for tagging 
jets. This is expected to be 0.86 at CMS for each tagging jet, resulting in a net efhciency of etag = (0.86)^ = 0.74. 
Even after this nontrivial loss of total rate, it is clear that the method we propose still works beautifully. 

As the H -^ WW mode is likely to be the discovery channel for the mass range 130 GeV < niH < 200 GeV, 
we wish to be able to reconstruct the Higgs boson mass. At threshold, the two (virtual) W's are at rest in the 
Higgs boson center-of-mass frame, resulting in mef_i = m^p, so we can calculate the transverse energy of both the 
charged lepton and invisible neutrino systems. 



En,. = sJp't., + m% , pT = V^T + K^ ■ (21) 

Using these results for the transverse energies, we may compute a transverse mass of the dilepton-^T system. 



Mt„„ = V (^^ + EtJ^ - (PT., +li>T)\ (22) 

At threshold this is exactly the Higgs boson transverse mass. Below threshold, the relation m^^ — m^i, is still 
an excellent approximation, while above threshold it begins to lose validity as the W bosons acquire a non-zero 
velocity in the Higgs boson rest frame. But even at mn = 200 GeV this "pseudo" transverse mass remains 
extremely useful for mass reconstruction. We show the dramatic results in Fig. ^ for Higgs boson masses of 130, 
160 and 190 GeV. Clearly visible is the Jacobian peak at Mt„„ = ran, in particular for ran = 160 GeV. The 
combined backgrounds are added to the Higgs signal, and are shown after application of all cuts and detector 
efficiencies, as well as both the b and minijet vetoes which were discussed in the previous Sections. 
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FIG. 4. Dilepton-j^T transverse mass distributions expected for a Higgs boson of mass mn = 130, 160, and 
190 GeV (red) after the cuts of Eqs. (|l0|jlj) and application of all detector efficiencies and a minijet veto with 
PT,voto = 20 GeV. Also shown is the background only (dashed). 



As the Mt distribution shows a pronounced Jacobian peak for all Higgs boson masses under consideration, the 
signal-to-background ratio and the statistical significance of the signal can be improved by restricting attention 
to the relevant Mt range. We find that up to VF-pair threshold {Mh < 165 GeV), the requirement Mt < 
Mh -\- 20 GeV works well. Above threshold, loosening this cut improves the signal significance. Table III 
demonstrates the power of this constraint, showing that it is possible to isolate a virtually background free 
qq -^ qqH, H -^ WW signal at the LHC, with sufficiently large counting rate to obtain a better than 5a signal 
with a mere 5 fb~^ of data for the mass range 140-200 GeV. Extending the observability region down to 130 GeV 
requires at most 15 fb^^. To reach 120 GeV would require ~ 65 fb~^ of low luminosity data (10^"^ cm^^ s~^), 
and to reach 115 GeV would require « 165 fb~^. This nicely overlaps the regions of observability for H — > 77 
(100-150 GeV) [Q and H ^ tt (110-140 GeV) ||l^]. The luminosity requirements in the low mass region can be 
reduced significantly by more stringent charged lepton cuts. 

The high purity of the signal is made possible because the weak boson fusion process, together with the 
H -^ W^W~ — > e^jX^^T decay, provides a complex signal with a multitude of characteristics which distinguish 



TABLE III. Number of expected events for the Hjj signal, for 5 fb^ integrated luminosity and a range of 
Higgs masses. As compared to Table O an additional upper limit on the Higgs transverse mass of Eq. (p3) is 
imposed, as given in the first line. The number of both signal and background events are shown, as well as S/B. 
The Poisson probability of the background to fluctuate up to the signal level is given in terms of crcauss, the 
number of Gaussian equivalent standard deviations. 



m.H (GeV) 


115 


120 


130 


140 


150 


160 


170 


180 


190 


200 


Mt cutoff (GeV) 


135 


140 


150 


160 


170 


180 


210 


220 


none 


none 


no. S events 


1.9 


3.4 


8.3 


14.8 


22.7 


36.5 


35.9 


29.3 


20.8 


16.3 


no. B events 


3.0 


3.4 


4.0 


4.7 


5.4 


6.0 


7.2 


7.5 


8.1 


8.1 


S/B 


0.6 


1.0 


2.0 


3.1 


4.2 


6.1 


5.0 


3.9 


2.6 


2.0 


OGauss 


0.8 


1.4 


3.1 


5.0 


6.8 


9.6 


9.0 


7.6 


5.5 


4.5 
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it from the various backgrounds. The basic feature of the qq -^ qqH signal is the presence of two forward tagging 
jets inside the acceptance region of the LHC detectors, of sizable pxj and of dijet invariant mass in the TeV 
range. Typical QCD backgrounds, with isolated charged leptons and two hard jets, are much softer. In addition, 
the QCD backgrounds are dominated by W bremsstrahlung off forward scattered quarks, which give typically 
higher-rapidity charged leptons. In contrast, the EW processes give rise to quite central leptons, and this includes 
not only the Higgs signal but also EW WWjj and ttjj production, which also proceed via weak boson fusion. 
It is this similarity that prevents one from ignoring EW analogs to background QCD processes, which a priori 
are smaller by two orders of magnitude in total cross section, but after basic cuts remain the same size as their 
QCD counterparts. 

For H —f WW decays, lepton angular distributions are extremely useful for reducing the backgrounds even 
further. The anti-correlation of W spins in H decay forces the charged leptons to be preferentially emitted in 
the same direction, close together in the lego plot. This happens for a small fraction of the background only. 
We have identified the most important distributions for enhancing the signal relative to the background, and 
set the various cuts conservatively to avoid bias for a certain Higgs boson mass range. There is ample room for 
improvement of our results via a multivariate analysis of a complete set of signal and background distributions, 
which we encourage the LHC collaborations to pursue. Additional suppression of the tt + jets background may 
be possible with b identification and veto in the px^ < 20 GeV region. 

In addition to various invariant mass and angular cuts, we can differentiate between the W^s of the signal and 
W, t backgrounds and the real t's in the QCD and EW ttjj backgrounds. This is possible because the high energy 
of the produced t's makes their decay products almost collinear. Combined with the substantial pr of the t^t^ 
system this allows for r-pair mass reconstruction. The W decays do not exhibit this coUinearity due to their large 
mass, thus the angular correlation between the "^t vector and the charged lepton momenta is markedly different. 
Our real-T rejection makes use of these differences and promises to virtually eliminate the ttjj backgrounds. 

We advocate taking advantage of an additional fundamental characteristic of QCD and EW processes. Color- 
singlet exchange in the t-channel, as encountered in Higgs boson production by weak boson fusion (and in the 
EW Zjj background), leads to additional soft jet activity which differs strikingly from that expected for the 
QCD backgrounds in both geometry and hardness: gluon radiation in QCD processes is typically both more 
central and harder than in WBF processes. We exploit this radiation, via a veto on events with central minijets 
of pt > 20 GeV, and expect a typical 70% reduction in QCD backgrounds and about a 25% suppression of EW 
backgrounds, but only about a 10% loss of the signal. 

Beyond the possibility of discovering the Higgs boson in the H -^ WW mode, or confirmation of its existence, 
measuring the cross sections in both weak boson and gluon fusion will be important both as a test of the Standard 
Model and as a search for new physics. For such a measurement, via the analysis outlined in this paper, minijet 
veto probabilities must be precisely known. For calibration purposes, one can analyze Zjj events at the LHC. 
The production rates of the QCD and EW Zjj events can be reliably predicted and, thus, the observation of 
the Z ^ £i peak allows for a direct experimental assessment of the minijet veto efficiencies, in a kinematic 
configuration very similar to the Higgs signal. 

Observation of SM H -^ e^fjr^^T at the LHC is possible for very low integrated luminosities, if the Higgs boson 
lies in the mass range between about 130 and 200 GeV. Weak boson fusion at the LHC will be an exciting process 
to study, for a weakly coupled Higgs sector just as much as for strong interactions in the symmetry breaking 
sector of electroweak interactions. 
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